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‘MEMORANDUM FOR: Dtrector of Personnel 
Director of Training 


SUBJECT Stenowrlters 


1, On 22 April 1963, in company with a number of other personnel from 
the Agency, I visited the IBM research facility at Yorktown Heights, New York. 


2. One of the most interesting developments which I saw there was a 
“stenowriter". Oversimplified, it works as follows: 


A stenotypist cuts a tape which is fed automatically into a 
machine. In a matter of seconds his notes are transcribed and 
hard copy is produced by a flexowriter, all of this being done 
automatically. 


The initial copy looks like Tab A. A further refinement, also by machine and 
automatic, looks like Tab B. Only one of these machines has been built, and 
it has been sold. IBM has a proposal with the Agency to build another for the 
Foreign Documents Division for something like $600,000. Whether or not we 
buy this machine now, I am sure that it has great potential. 


3. IBM estimates that one of these machines can handle the output of 
thirty stenotypists. Obviously a key to ite operation is the availability of a 
nurnber of qualified stenotypists. The stenotypist at IBM who participated in the 
demonstration told us that it took approximately two years to become a qualifted 
stenotypist. He qualified in seventeen months and said that he knew of only one 
other person who had been able to do it in as short a time as eleven months. 


4. I should lke to request the Director of Personnel and the Director of 
Training to contact the Chief of the Foreign Documents Division and to be of any 
possible assistance in connection with the recruitment or training of stenotypists. 
I would algo like to be advised as to the general problems in connection with the 
recruitment of stenotypista and as to the speed with which they can be trained if 
we should decide now or later to conduct In-house training for them. 


DD/S:LKW:jas oigned 
Distribution: L. K. White 


O- D/Pers w/atts 
1 - DTR w/atts Deputy Director 


2. Attachments (Support) 
cc: C/FDD w/atts 
1 - DD/S chrono w/o att 
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IBM Research 


Presentation to Central Intelligence Agency 
on 
Advanced Research in Automatic Language Processing 


April 22, 1963 


10:15 AM Arrive Research Center (Coffee) 

10:30 AM Welcoming Remarks 

10:45 AM Introduction to IBM Research 

11:00 AM General Introduction to the Research Program 
in Language Processing 

11:15 AM Fundamental Linguistic Aspects of Machine 
Translation 

12:00 Lunch 

1:00 PM Applicd Research in Machine Translation 


Linyuistics 


1, Dictionary Preparation 


2. Syntactic Analysis 
1:30 PM Chinese-English Machine Translation 
1. Input Preparation 


2. Lexicon Preparation 
3. Syntactic Analysis 


2:00 PM Systems Organization for Language Translation 
2:30 PM Coffee Break 

2:50 PM Evaluation of Methods and Progress 

3:05 PM The Stenowriter Concept 


1. Automatic Transcription of Speech 
2. Conversion of Document Files 


3:25 PM Research on Print Reading 
3:45 PM General Discussion 
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DIRECT MACHINE TRANSLATION WITH LINES AND WORDS IDENTIFIED 


i ees | ns G 10 f1 1213 14 816 17 
Today is February I], 1963. We wants now Like to have a brief demonstration of the opera- 


1 2 3 6 7 8 9 
tion of the Sicnowlitey Syste for Vice President Miller. 
4.5 3) 10 12.13 


The Stehowriees Systeri is an experimental one which has been developed to automatical- 


5 ry 11 1 


ly mnenbe shorthand reeonds into proper English with all nccdssawy punctuation and format. 


I 3 & 9 10 
By the way Mr. Miller pidnee feel frec lo ae 


] 2 3 G 10 131213 14 1S 16 17 18 
The shorthand that ‘a ised is called stenourpy and this is a very o used type of key- 


5 6 8 10 12 13 14 
board shordiand: iti is used in courieoH recording, confeneiete: lectures, meeuaes for office dic- 


1 2 4°55 
tation, sometimes, and soon. 


1 2 3 4 5 6 7 8 9 10 1k 12 13 14 15 
How many Eee operators are thercin the United States. Has any one got any figures. 


t 56 7 8 
The faves Ane available I don’t have them. 


L283 4S G6 7 8 9 
Is it growing or whether it is very static. 


1 2 30°44 5 10 11 13 
It seems to be growing seconding to the president of stenograph maclines incorporated. 


I a 3 | 5 G 8 9 10 11 12 13 1415 1G 
Last year he sold an approximately fail thousand machines and this has been a gradual increase o- 


1 2 3 t 
ver the ten years. 


H 2 3 4 5 
What does the machine cot 


1 23 4 5 
About l hundred 25 dollars. 


1 2 3 4065 6 7 8 9 17) is} 
The Stenowriter aga is capableed transcribcing the Steno information into English At 


12 3 df 5 6 7 9 10 «4 12 13 15 
a very fast rate. We are now oie ently working on certain advance procedures to handle sok 


2 3 G 7 9 1011 12 13 


1 
things as automatic spelling ee correction and alsb on a program to.automatically Coreen ea : 


i 2 3 4 5 6 7 9 10 11 1213 1415 : < 16 17 18 
ited text so that a perfect copy ay. be obtained with a minute * minute mewn of human effort. 


] 2 3°04 5 6 7 8 9 1011 139 4 15 16 17 
An operation * operational system would be capable of a throughput ae of 70 to 75 re- 


23 4 5 G 7 9 10 Il 12 14.15 16 17 1819 


1 
cords * words per second. This means that one hour of speech could be transcribed in only 2 min- 


Hy 
utes. 
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3 9 ig ul 
iy ancoaei put, rks- ibl-a-fas tithe stenotype opetitob ean take kick Oxdictation around s 


1 2 
50 words pee accord: 


1 2 3 4 5 6 4 8 

Thatis correct. What was the through put. 

1 2 #3 5 7 8 9 10 YW 12 13 14 1516 

70 to 75 words per minute: On the a supposition that we have about 30 stenotypists all 


2 3 4°65 6 8.9 10 11 12 13 14 13 16 ye o18 19 
wong full time at an average re of s 50 words per minute. The system operating at that rate 


1 2 3 3 
cuckooep-up this number of stenotypists 


Le 2 8 9 10.11 12 13 1415 16 17 18 #19 
It isn ‘irinierial whether it isOin euquenee or how itis. The fact is the computer is so fast it 
7 89 7 
could handle 30 of these people ata time. 
23 #4 2 G 10 11 12 14 


Could it be doing pir else. Could the computer be solving some other problems. 


1 2 3.464 10 1] 12 1314 15.16 WW 
Just ioe to keep iB with five atenobpists its only being use a fraOtion of the time. 
1 


His this i in : 7090? 


1 2 3 45 10 12 13 14 
No, we have a spccial purpose computer which has been designed for language proc- 
2 3°64 5 6 9 10 11 12 13 14 15 iG 17 
me and it uses a experimental memory knoe as the photostore and also has some special pur- 
5 8 9 10 11 12 13 14 1516 
pase letieal lone axsocinted with it, However. we can program this Stenowriter System on a com- 
5 G 12°13 
mercially pvadable ne but, using current equipment the throughput rate would be less 
5.6 
than this figure that I quoteed. Oft hand without actually * +90 *g-thru-hkair-fuli wouldn’ t now 
how fast 
| 23 4 5 6 10 11 12 13 14 15 16 1718 
What I hear you saying at fis eee development of the art, we are thinking about a spe- 


1 2 3 6 
cial purpose computer ‘6 do what's eine done. 


1 2 
tee translation. 


1 5 6 11 12 13 14 
He i talkin g about a special language ProEcseine machine However thereisa pormers 
] 5 67 12 13 14 1516 


we haven’ t explored yet is a special memory. Control however with the logic in a standard uae 


chine. 


2 5 10 11 1213 14 15 17 18 19 
It seems to me what we are talking about here is a service department when I use the next 


1 23 4 5 6 & 9 10 11 12 13 14 15 16 1718 19 
term I don’t mean our service pareau but a service borough in a large corporation in a court sys- 


1 2 3 4 5 67 11 12 13 14.15 1G 
tem or may be ina mes that eeniee Buren would have one of these and people could dial in to it 


89 10 1112 13 14 15 1617 18 19 
through, the telephone on a schedule bags: I don’t see it as being sold toa lot of individuals. Be: 


1 78 10 11 12 
cause it looks to He like a pretty penéive computer gee og-fi. 
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LINOFILM COPY OF FINAL OUTPUT OF EDITING PROGRAM 


Today is February Il, 1963. We would now like to have a brief’ demonstration of 
the operation of the Stenowriter System for Vice President Miller. 

The Stenowriter System is an experimental one which has been developed to 
automatically transcribe shorthand records into proper English with all necessary punctuation 
and format. By the way Mr. Miller pease {cel {ree to interrupt. 

The shorthand that is used is called stenotypy and this is a very widely used type 
o: keyboard shorthand. It is used in courtroom recording, conferences, lectures, meetings, for of- 
fice dictation, sometimes, and so on. 

How many stenotype operators are there in the United States? Has aiiy vue vot 
any figures? 

The figures are available. I don’t have them. 

Is it growing or is it very static? 

It seems to be growing according to the president of Stenograph Machines Incor- 
porated. Last verr he sold approximately ten thousand machines and there has been a gradual in- 
crease OVer Lie past ten years. 

What does the machine cost? 

About one hundred twenty-five dollars. 

The Stenowriter System is capable of transcribing the Steno information into 
English at a very fast rate. We are now currently working on certain advanced procedures to han- 
dle such things as automatic spelling error correction and also ona program to automatically cor- 
rect edited text so that a perfect copy may be obtained with a minimum of human effort. 

An operational system would be capable of a throu ehput rate of 70 to 75 words 
per second. This means that one hour of speech could be transcribed in only 2 minutes. 

By throughput I mean the stenotype operator can take dictation around 150 
words per minute. That is correct. What was the throughput? 

70 to 75 words per minute. On the supposition that we have about 30 stenotyp- 
ists all working full time at an average rate of 150 words per minute. The system operating at 
thatrate could keep up with this number of stenotypists. 

It isn’t material whether it is in sequence or how itis. The fact is the computer is 
so fast it could handle 30 of these people ata time. 

Could it be doing anything else? Could the computer be solving some other prob- 
lems? 

Just trying to keep up with five stenotypists, it’s only being use a fraction of the 
time. 

Is this in a 7090? 

No, we have a special purpose computer which has been designed for language 
processing and it uses an experimental memory known as the photostore and also has some spe- 
cial purpose lexical logic associated with it. However, we can program this Stenowriter System 
on a commercially available machine, but using current equipment ti.e throughput rate would be 
less than this figure that I quoted. Offhand without actually going through it carefully I wouldn’t 
know how fast. 

What [hear you saying at this stage of development of the art, we are thinking a- 
bout a special purpose computer to do what’s being done. 

Language translation. 

He is talking about a special language processing machine. However there is a 
compromise we haven't explored yet. That is a special memory, controlled however with the logic 
in a standard machine. 

It seems to me what we are talking about here is a service department and when 
I use that term J don’t necessarily mean our Service Bureau but a service bureau in a large corpo- 
ration in a court system or maybe in a city. That service bureau would have one of these and peo- 
ple could dial into it through the telephone on a schedule basis. I don't see it as being sold toa lot 
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of individuals. Because it looks to me like a pretty expensive computer geography. 

Most of the people who operate have their own equipment. 

You mean these people? 

They may buy this service and get the transcript the next morning. 

In thinking about this I tend to put a parameter around my thinking by reason of 
the fact I'm going at it en masse immediately for serving a lot of people. 

An integrated or centralized system? 

It probably wouldn’t be the IBM Company itself. It probably would be the Ser- 
vice Bureau or a company who wanted to be the conference or court reporting bureau in Washing- 
Lon, 

Yor example, the C.I.A. has an installation where human translators dictate 
very rapidly. They translate into magnetic tape and it takes 30 full time typists to transcribe this. 
They visualize with this system they would substitute eight stenotypists witha saving of several 
hundred thousand a year, at much speed.That brings me to the next question, is this thing in a 
state of development where it could go into a governmental department where the government 
would be getting benefits from a joint study? 

We have exactly this sort of bid with the C.LA. at this time, we have proposed aut- 
omating just this function with the stcnotype system and they are very much interested. 

Would they finance the thing? 

We contribute the computer time and the programming and the research part of 
it. We also have a discussion with the Air Force, we have a large file in Omaha and they want to 
convert these into magnetic tape records for their information retrieval system. 

Instead of punching You could think of doing stenotypy. 

While he has been doing this writing the data has been going through the wire. 

What I'd like to do now is to make a few more remarks about what the Stenowri- 
ter technique could be used for. Its basic application is in reducing the delay between spoken 
word of any kind and the printed record. 

It can also be used for feeding information into data processing machines, be- 
cause stenotypy, we have shown, is approximately 5 times faster than key punching and approxi- 
mately 3 to 4 times as fast as typing. So that another important application would be for the eftfi- 
cicnt conversion of large amounts of document files into machine readable form. 

Its 5 times as fast as punched cards. Has anyone tried to build one of those 
things so that you could have other than the tape that wide to write on, you wouldn't need to. Lets 
suppose he was transcribing some bills instead of punching, suppose there was an 024 and you 
are given a hundred bills, he would get through faster. He can go right on-line, right on the mag- 
netic tape to the 1401.Things like special format, like headings. Here you can strike all the keys or 
any combination of them at the same time. Since there are 23 keys you have a total number of 
slightly over 8 million possible strokes. 

How long did it take you to figure that out? It is possible with this machine you 
don’t need the 1401, its also possible to store addition tables so that when a stenotypist finished do- 
ing a bill or an expense account if he hits a special code which says, ‘sum’, the addition tables 
would allow the sums to be printed out. We have done this. We have had in past dictionaries addi- 
tion tables for that purpose. We have had timetables for the Harmon station so that a secretary 
can say put in a time and a code and get a printout of what trains were leaving, plus or minus one 
hour around that time. We have a large capacity memory in which you-can store any tables. 

You have a tremendous number of symbols to address anything you want. He 
has many million. , 

As an example of this let’s give them the title page of a courtroom record and 
let’s also give the swearing in ceremony in a court of law. 


2 
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NEW YORK SUPREME COURT 


KINGS COUNTY 
TRIAL TERM, PART VIII 
weer ee ----- x 
NAME, : 
Plaintiff, : 
against 
NAME, 
Defendant. : 
ween ee eeeee x 


Brooklyn, N. Y., October 16, 1939. 


3efore 
IION. NAME, Justice, 
and a jury. 
Appearances: 
Aitorney for the Plaintiff. 
Attorney for the Defendant. 


Patrick J.O’Neil! Jr., 
Official Reporter. 


you God? 
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Do you swear to tell the truth, the whole truth, and nothing but the truth, so help 
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